Automation and Validation of Annotation for Hindi Anaphora Resolution
نویسندگان
چکیده
The process of labelling any language genre by which one can extract useful information is called annotation. This provides syntactic information about a word or a word phrase. In this paper, an effort has been made to provide the algorithm for semiautomatic annotation for Hindi text to cater anaphora resolution only. The study was conducted on twelve files of Ranchi Express available in EMILLE corpus. The corpus is originally tagged for demonstrative pronouns. The detection of the pronouns is supported by the incorporation of seven tags. However the semantic interpretation of the demonstrative pronoun is not supported in the original corpus. In this paper an effort has been made to automate the process of tagging as well as the handling of semantic information through addition tags. It was conducted on 1485 demonstrative pronouns. The average accuracy of precision, recall and F measure is 74, 71 and 72 respectively. Keywords—Annotation; natural language processing; demonstrative pronoun; semantic category; indirect anaphora; semiautomatic annotation
منابع مشابه
Anaphora Annotation in Hindi Dependency TreeBank
In this paper, we propose a scheme for anaphora annotation in Hindi Dependency Treebank. The goal is to identify and handle the challenges that arise in the annotation of reference relations in Hindi. We identify some of the issues related to anaphora annotation specific to Hindi such as distribution of markable span, sequential annotation, representation format, annotation of multiple referent...
متن کاملAnimacy Annotation in the Hindi Treebank
In this paper, we discuss our efforts to annotate nominals in the Hindi Treebank with the semantic property of animacy. Although the treebank already encodes lexical information at a number of levels such as morph and part of speech, the addition of animacy information seems promising given its relevance to varied linguistic phenomena. The suggestion is based on the theoretical and computationa...
متن کاملEvent Anaphora Resolution in Natural Language Processing for Hindi text
This paper presents a comprehensive study about the anaphora resolution. Anaphora resolution can be of any type such as Entity Anaphora resolution and Event Anaphora resolution etc. Event Anaphora resolution is the main focus of this review paper. This resolution of Event anaphora can be different for different languages as the structure of the language changes. Here the literature about anapho...
متن کاملA Machine Learning Approach to Resolve Event Anaphora
Anaphora Resolution is considered as one of the important area in the field of Natural Language Processing. A lot of research has been done in anaphora resolution for English Language, but in Hindi language the research is limited. Most of the researches for Hindi anaphora resolution work for entity anaphora resolution. This paper presents a machine learning approach which can works for event a...
متن کاملPronominal Reference Type Identification and Event Anaphora Resolution for Hindi
In this paper, we present hybrid approaches for pronominal reference type (abstract or concrete) identification and event anaphora resolution for Hindi. Pronominal reference type identification is one of the important parts for any anaphora resolution system as it helps anaphora resolver in optimal feature selection based on pronominal reference types. We use language specific rules and feature...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015